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SEQUENCE LISTING 

<110> Merck & Co., Inc. 

Jansen, Kathrin U. 
Schultz, Loren D. 
Neeper, Michael P. 
Markus, Henry Z. 

<120> OPTIMIZED EXPRESSION OF HPV 31 LI IN 
YEAST 

<130> 21188-PCT 

<150> 60/457,172 
<151> 2003-03-24 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1515 
<212> DNA 

<213> HPV31 LI wild- type 
<400> 1 

atgtctctgt ggcggcctag cgaggctact gtctacttac cacctgtccc agtgtctaaa 60 
gttgtaagca cggatgaata tgtaacacga accaacatat attatcacgc aggcagtgct 120 
aggctgctta cagtaggcca tccatattat tccataccta aatctgacaa tcctaaaaaa 180 
atagttgtac caaaggtgtc aggattacaa tatagggtat ttagggttcg tttaccagat 240 
ccaaacaaat ttggatttcc tgatacatct ttttataatc ctgaaactca acgcttagtt 3 00 
tgggcctgtg ttggtttaga ggtaggtcgc gggcagccat taggtgtagg tattagtggt 3 60 
catccattat taaataaatt tgatgacact gaaaactcta atagatatgc cggtggtcct 420 
ggcactgata atagggaatg tatatcaatg gattataaac aaacacaact gtgtttactt 480 
ggttgcaaac cacctattgg agagcattgg ggtaaaggta gtccttgtag taacaatgct 540 
attacccctg gtgattgtcc tccattagaa ttaaaaaatt cagttataca agatggggat 600 
atggttgata caggctttgg agctatggat tttactgctt tacaagacac taaaagtaat 660 
gttcctttgg acatttgtaa ttctatttgt aaatatccag attatcttaa aatggttgct 720 
gagccatatg gcgatacatt atttttttat ttacgtaggg aacaaatgtt tgtaaggcat 780 
ttttttaata gatcaggcac ggttggtgaa tcggtcccta ctgacttata tattaaaggc 840 
tccggttcaa cagctacttt agctaacagt acatactttc ctacacctag cggctccatg 900 
gttacttcag atgcacaaat ttttaataaa ccatattgga tgcaacgtgc tcagggacac 960 
aataatggta tttgttgggg caatcagtta tttgttactg tggtagatac cacacgtagt 1020 
accaatatgt ctgtttgtgc tgcaattgca aacagtgata ctacfeitttaa aagtagtaat 1080 
tttaaagagt atttaagaca tggtgaggaa tttgatttac aatttatatt tcagttatgc 1140 
aaaataacat tatctgcaga cataatgaca tatattcaca gtatgaatcc tgctattttg 1200 
gaagattgga attttggatt gaccacacct ccctcaggtt ctttggagga tacctatagg 1260 
tttgtaacct cacaggccat tacatgtcaa aaaagtgccc cccaaaagcc caaggaagat 1320 
ccatttaaag attatgtatt ttgggaggtt aatttaaaag aaaagttttc tgcagattta 1380 
gatcagtttc cactgggtcg caaattttta ttacaggcag gatatagggc acgtcctaaa 1440 
tttaaagcag gtaaacgtag tgcaccctca gcatctacca ctacaccagc aaaacgtaaa 1500 
aaaactaaaa agtaa 1515 

<210> 2 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 partial rebuild 
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<400> 2 

atgtctctgt ggcggcctag cgaggctact 
gttgtaagca cggatgaata tgtaacacga 
aggctgctta cagtaggcca tccatattat 
atagttgtac caaaggtgtc aggattacaa 
ccaaacaaat ttggatttcc tgatacatct 
tgggcctgtg ttggtttaga ggtaggtcgc 
catccattat taaataaatt tgatgacact 
ggcactgata atagggaatg tatatcaatg 
ggttgcaaac cacctattgg agagcattgg 
attacccctg gtgattgtcc tccattagaa 
atggttgata caggctttgg agctatggat 
gttcctttgg acatttgtaa ttctatttgt 
gagccatacg gcgacacctt gttcttctat 
ttcttcaaca gatccggcac cgtaggtgaa 
tccggttcca ccgctaccct ggctaactcc 
gtcacctccg acgctcagat cttcaacaag 
aacaacggta tctgttgggg taaccagctg 
accaacatgt ctgtctgtgc tgcaatcgct 
ttcaaggagt acctgagaca tggtgaggaa 
aagatcaccc tgtctgctga catcatgacc 
gaggactgga acttcggtct gaccactcca 
tttgtaacct cacaggccat tacatgtcaa 
ccatttaaag attatgtatt ttgggaggtt 
gatcagtttc cactgggtcg caaattttta 
tttaaagcag gtaaacgtag tgcaccctca 
aaaactaaaa agtaa 



gtctacttac cacctgtccc agtgtctaaa 60 
accaacatat attatcacgc aggcagtgct 120 
tccataccta aatctgacaa tcctaaaaaa 180 
tatagggtat ttagggttcg tttaccagat 240 
ttttataatc ctgaaactca acgcttagtt 300 
gggcagccat taggtgtagg tattagtggt 3 60 
gaaaactcta atagatatgc cggtggtcct 420 
gattataaac aaacacaact gtgtttactt 480 
ggtaaaggta gtccttgtag . taacaatgct 540 
ttaaaaaatt cagttataca agatggggat 600 
tttactgctt tacaagacac taaaagtaat 660 
aaatatccag attatcttaa aatggttgct 720 
ttgcgtagag aacagatgtt cgtaaggcac 780 
tctgtcccaa ccgacctgta catcaagggc 840 
acctacttcc caactccatc tggctccatg 900 
ccatactgga tgcagcgtgc acagggtcac 960 
ttcgtgactg tggtcgatac cacgcgttct 1020 
aactctgaca ctaccttcaa gtcctctaac 1080 
ttcgatctgc aattcatctt ccagttgtgc 1140 
tacatccaca gtatgaaccc tgccatcctg 1200 
ccttccggtt ctttggagga tacctatagg 1260 
aaaagtgccc cccaaaagcc caaggaagat 1320 
aatttaaaag aaaagttttc tgcagattta 1380 
ttacaggcag gatatagggc acgtcctaaa 1440 
gcatctacca ctacaccagc aaaacgtaaa 1500 

1515 



<210> 3 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 total rebuild 



<400> 3 

atgtctttgt ggagaccatc tgaagctacc 
gtcgtctcta ccgacgaata cgtcaccaga 
agattgttga ccgtcggtca cccatactac 
atcgtcgtcc caaaggtctc tggtttgcaa 
ccaaacaagt tcggtttccc agacacctct 
tgggcttgtg tcggtttgga agtcggtaga 
cacccattgt tgaacaagtt cgacgacacc 
ggtaccgaca acagagaatg tatctctatg 
ggttgtaagc caccaatcgg tgaacactgg 
atcaccccag gtgactgtcc accattggaa 
atggtcgaca ccggtttcgg tgctatggac 
gtcccattgg acatctgtaa ctctatctgt 
gaaccatacg gcgacacctt gttcttctac 
ttcttcaaca gatccggcac cgtaggtgaa 
tccggttcca ccgctaccct ggctaactcc 
gtcacctccg acgctcagat cttcaacaag 
aacaacggta tctgttgggg taaccagctg 
accaacatgt ctgtctgtgc tgcaatcgct 
ttcaaggagt acctgagaca tggtgaggaa 
aagatcaccc tgtctgctga catcatgacc 
gaggactgga acttcggtct gaccactcca 
ttcgtcacct ctcaagctat cacctgtcaa 
ccattcaagg actacgtctt ctgggaagtc 
gaccaattcc cattgggtag aaagttcttg 
ttcaaggctg gtaagagatc tgctccatct 



gtctacttgc caccagtccc agtctctaag 60 
accaacatct actaccacgc tggttctgct 120 
tctatcccaa agtctgacaa cccaaagaag 180 
tacagagtct tcagagtcag attgccagac 240 
ttctacaacc cagaaaccca aagattggtc 300 
ggtcaaccat tgggtgtcgg tatctctggt 360 
gaaaactcta acagatacgc tggtggtcca 420 
gactacaagc aaacccaatt gtgtttgttg 480 
ggtaagggtt ctccatgttc taacaacgct 540 
ttgaagaact ctgtcatcca agacggtgac 600 
ttcaccgctt tgcaagacac caagtctaac 660 
aagtacccag actacttgaa gatggtcgct 72 0 
ttgcgtagag aacagatgtt cgtaaggcac 780 
tctgtcccaa ccgacctgta catcaagggc 84 0 
acctacttcc caactccatc tggctccatg 900 
ccatactgga tgcagcgtgc acagggtcac 960 
ttcgtgactg tggtcgatac cacgcgttct 1020 
aactctgaca ctaccttcaa gtcctctaac 1080 
ttcgatctgc aattcatctt ccagttgtgc 1140 
tacatccaca gtatgaaccc tgccatcctg 1200 
ccttccggtt ctttggaaga cacctacaga 1260 
aagtctgctc cacaaaagcc aaaggaagac 1320 
aacttgaagg aaaagttctc tgctgacttg 1380 
ttgcaagctg gttacagagc tagaccaaag 1440 
gcttctacca ccaccccagc taagagaaag 1500 
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aagaccaaga agtaa 1515 



<210> 4 
<211> 504 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> HPV 31 LI 



<400> 4 






















Pro Val 


Met 


Ser 


Leu 


Trp 


Arg 


Pro 


Ser Glu 


Ala 


Thr 


Val 


Tyr 


Leu Pro 


1 






5 








10 








15 


Pro 


Val 


Ser 


Lys 


Val 


Val 


Ser Thr 


Asp 


Glu 


Tyr 


Val 


Thr Arg 


Thr Asn 








20 








25 








30 


His Pro 


He 


Tyr 


Tyr His Ala Gly 


Ser Ala 


Arg 


Leu 


Leu 


Thr 


Val Gly 




35 








40 










45 




Tyr 


Tyr 


Ser 


He 


Pro 


Lys 


Ser Asp 


Asn 


Pro 


Lys 


Lys 


He Val 


Val Pro 


50 










55 








60 






Lys 


Val 


Ser Gly Leu Gin Tyr Arg Val 


Phe Arg Val 


Arg Leu 


Pro Asp 


65 










70 








75 






80 


Pro 


Asn 


Lys 


Phe 


Gly 


Phe 


Pro Asp 


Thr 


Ser 


Phe 


Tyr 


Asn Pro 


Glu Thr 








85 








90 








95 


Gin Arg 


Leu 


Val 


Trp Ala 


Cys Val 


Gly 


Leu 


Glu 


Val 


Gly Arg Gly Gin 








100 








105 








110 




Pro 


Leu 


Gly Val 


Gly 


He 


Ser Gly 


His 


Pro 


Leu 


Leu 


Asn Lys 


Phe Asp 






115 






120 










125 




Asp 


Thr 


Glu Asn Ser Asn Arg Tyr Ala Gly Gly Pro Gly Thr Asp Asn 


130 










135 








140 






Arg 


Glu 


Cys 


He 


Ser 


Met Asp Tyr 


Lys 


Gin 


Thr 


Gin 


Leu Cys- Leu Leu 


145 








150 








155 






160 


Gly Cys 


Lys 


Pro 


Pro He Gly Glu His Trp Gly Lys Gly Ser 


Pro Cys 








165 








170 








175 


Ser 


Asn 


Asn 


Ala 


He 


Thr 


Pro Gly Asp 


Cys 


Pro 


Pro 


Leu Glu 


Leu Lys 








180 








185 








190 




Asn 


Ser 


Val 


He Gin Asp Gly Asp Met 


Val 


Asp 


Thr 


Gly Phe 


Gly Ala 






195 








200 










205 




Met 


Asp 


Phe 


Thr 


Ala 


Leu Gin Asp 


Thr 


Lys 


Ser 


Asn 


Val Pro 


Leu Asp 




210 










215 








220 






He 


Cys 


Asn 


Ser 


He 


Cys 


Lys Tyr 


Pro Asp 


Tyr 


Leu 


Lys Met 


Val Ala 


225 








230 








235 






240 


Glu 


Pro 


Tyr 


Gly Asp 


Thr 


Leu Phe 


Phe 


Tyr Leu Arg Arg Glu 


Gin Met 








245 








250 








255 


Phe 


Val 


Arg 


His 


Phe 


Phe 


Asn Arg 


Ser Gly Thr Val 


Gly Glu 


Ser Val 






260 








265 








270 




Pro 


Thr 


Asp 


Leu 


Tyr 


He 


Lys Gly 


Ser Gly 


Ser 


Thr 


Ala Thr 


Leu Ala 






275 








280 










285 




Asn 


Ser 


Thr 


Tyr 


Phe 


Pro 


Thr Pro 


Ser Gly 


Ser 


Met 


Val Thr 


Ser Asp 




290 








295 








300 






Ala 


Gin 


He 


Phe 


Asn 


Lys 


Pro Tyr 


Trp 


Met 


Gin 


Arg Ala Gin Gly His 


305 










310 








315 






320 


Asn 


Asn 


Gly 


He 


Cys 


Trp 


Gly Asn 


Gin 


Leu 


Phe 


Val 


Thr Val 


Val Asp 








325 






330 








335 


Thr 


Thr 


Arg 


Ser 


Thr 


Asn 


Met Ser 


Val 


Cys 


Ala 


Ala 


He Ala 


Asn Ser 






340 








345 








350 




Asp 


Thr 


Thr 


Phe 


Lys 


Ser 


Ser Asn 


Phe 


Lys 


Glu 


Tyr 


Leu Arg 


His Gly 




355 








360 










365 




Glu 


Glu 


Phe Asp 


Leu 


Gin 


Phe He 


Phe 


Gin 


Leu 


Cys 


Lys He 


Thr Leu 




370 










375 








380 






Ser 


Ala 


Asp 


He 


Met 


Thr 


Tyr He 


His 


Ser 


Met 


Asn 


Pro Ala 


He Leu 


385 








390 






395 






400 


Glu Asp 


Trp Asn Phe Gly Leu Thr Thr 


Pro 


Pro 


Ser Gly Ser 


Leu Glu 
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405 410 415 



Asp 


Thr 


Tyr 


Arg 


Phe 


Val 


Thr 


Ser 


Gin 


Ala 


He 


Thr 


Cys 


Gin 


Lys 


Ser 




420 










425 










430 


Phe 




Ala 


Pro 


Gin 


Lys 


Pro 


Lys 


Glu 


Asp 


Pro 


Phe 


Lys 


Asp 


Tyr 


Val 


Trp 






435 








440 










445 








Glu 


Val 


Asn 


Leu 


Lys 


Glu 


Lys 


Phe 


Ser 


Ala 


Asp 


Leu Asp 


Gin 


Phe 


Pro 




450 










455 










460 










Leu 


Gly 


Arg 


Lys 


Phe 


Leu 


Leu 


Gin 


Ala 


Gly 


Tyr Arg 


Ala 


Arg 


Pro 


Lys 


465 




470 










475 










480 


Phe 


Lys 


Ala 


Gly 


Lys 


Arg 


Ser 


Ala 


Pro 


Ser 


Ala 


Ser 


Thr 


Thr 


Thr 


Pro 






485 










490 










495 




Ala 


Lys 


Arg 


Lys 


Lys 


Thr 


Lys 


Lys 



















500 



<210> 5 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 5 

cgtcgacgta aacgtgtatc atattttttt acag 

<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 6 

cagacacatg tattacatac acaac 

<210> 7 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 7 

ctcagatctc acaaaacaaa atgtctctgt ggcggcctag c 

<210> 8 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<400> 8 

gacagatctt actttttagt ttttttacgt tttgctgg 



38 



